Model Selection

Lightweight quantization

# Lightweight quantization

Smolvlm2 2.2B Instruct I1 GGUF

SmolVLM2-2.2B-Instruct is a vision-language model with a parameter scale of 2.2B, focusing on video text-to-text tasks and supporting English.

Llama 3 VNTL Yollisa 8B I1 GGUF

This is a weighted/matrix quantized version of Casual-Autopsy/Llama-3-VNTL-Yollisa-8B, suitable for English and Japanese processing, specifically targeting Japanese media, otaku media, and visual novels (VNs).

Large Language Model Supports Multiple Languages

Gte Qwen2 1.5B Instruct GGUF

A quantized version based on Alibaba NLP/gte-Qwen2-1.5B-instruct, primarily used for sentence similarity computation and text embedding tasks.

Large Language Model English

Gemma 2 Baku 2b It GGUF

This is the GGUF format conversion version of the gemma-2-baku-2b-it model from the rinna company, applying K quantization and iMatrix technology

Large Language Model

Transformers Supports Multiple Languages

GPT NeoX 1.3B Viet Final GGUF

1.3B parameter GPT-NeoX model pretrained on 31.3GB Vietnamese data

Large Language Model English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase